NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction

https://doi.org/10.18653/v1/2025.emnlp-main.584

Wang, Song; Tan, Zhen; Chen, Zihan; Zhou, Shuang; Chen, Tianlong; Li, Jundong (November 2025, Association for Computational Linguistics)

Full Text Available
FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference

Wang, Dongwei; Liu, Zijie; Wang, Song; Ren, Yuxin; Deng, Jianing; Hu, Jingtong; Chen, Tianlong; Yang, Huanrui (November 2025, Association for Computational Linguistics)

Full Text Available
Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework

Chen, Yuhang; Tan, Zhen; Jaiswal, Ajay; Qu, Huaizhi; Zhao, Xinyu; Lin, Qi; Cheng, Yu; Kwong, Andrew; Cao, Zhichao; Chen, Tianlong (November 2025, Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing)

Full Text Available
GraphRCG: Self-Conditioned Graph Generation

https://doi.org/10.1145/3746252.3761205

Wang, Song; Tan, Zhen; Zhao, Xinyu; Chen, Tianlong; Liu, Huan; Li, Jundong (November 2025, ACM)

Full Text Available
SCALE: Towards Collaborative Content Analysis in Social Science with Large Language Model Agents and Human Intervention

https://doi.org/10.18653/v1/2025.acl-long.416

Zhao, Chengshuai; Tan, Zhen; Wong, Chau-Wai; Zhao, Xinyan; Chen, Tianlong; Liu, Huan (July 2025, Association for Computational Linguistics)

Full Text Available
Adapt-∞: Scalable Lifelong Multimodal Instruction Tuning via Dynamic Data Selection

Maharana, Adyasha; Yoon, Jaehong; Chen, Tianlong; Bansal, Mohit (April 2025, Proceedings of the International Conference on Learning Representations)

Full Text Available
DLF: Disentangled-Language-Focused Multimodal Sentiment Analysis

https://doi.org/10.1609/aaai.v39i20.35416

Wang, Pan; Zhou, Qiang; Wu, Yawen; Chen, Tianlong; Hu, Jingtong (April 2025, Proceedings of the AAAI Conference on Artificial Intelligence)
NA (Ed.)
Multimodal Sentiment Analysis (MSA) leverages heterogeneous modalities, such as language, vision, and audio, to enhance the understanding of human sentiment. While existing models often focus on extracting shared information across modalities or directly fusing heterogeneous modalities, such approaches can introduce redundancy and conflicts due to equal treatment of all modalities and the mutual transfer of information between modality pairs. To address these issues, we propose a Disentangled-Language-Focused (DLF) multimodal representation learning framework, which incorporates a feature disentanglement module to separate modality-shared and modality-specific information. To further reduce redundancy and enhance language-targeted features, four geometric measures are introduced to refine the disentanglement process. A Language-Focused Attractor (LFA) is further developed to strengthen language representation by leveraging complementary modality-specific information through a language-guided cross-attention mechanism. The framework also employs hierarchical predictions to improve overall accuracy. Extensive experiments on two popular MSA datasets, CMU-MOSI and CMU-MOSEI, demonstrate the significant performance gains achieved by the proposed DLF framework. Comprehensive ablation studies further validate the effectiveness of the feature disentanglement module, language-focused attractor, and hierarchical predictions.
more » « less
Full Text Available
Generate, Then Retrieve: Addressing Missing Modalities in Multimodal Learning via Generative AI and MoE

Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Ding, Ying; Long, Qi; Chen, Tianlong (March 2025, aUpA5gulZ4)

In multimodal machine learning, effectively addressing the missing modality scenario is crucial for improving performance in downstream tasks such as in medical contexts where data may be incomplete. Although some attempts have been made to retrieve embeddings for missing modalities, two main bottlenecks remain: (1) the need to consider both intra- and inter-modal context, and (2) the cost of embedding selection, where embeddings often lack modality-specific knowledge. To address this, the authors propose MoE-Retriever, a novel framework inspired by Sparse Mixture of Experts (SMoE). MoE-Retriever defines a supporting group for intra-modal inputs—samples that commonly lack the target modality—by selecting samples with complementary modality combinations for the target modality. This group is integrated with inter-modal inputs from different modalities of the same sample, establishing both intra- and inter-modal contexts. These inputs are processed by Multi-Head Attention to generate context-aware embeddings, which serve as inputs to the SMoE Router that automatically selects the most relevant experts (embedding candidates). Comprehensive experiments on both medical and general multimodal datasets demonstrate the robustness and generalizability of MoE-Retriever, marking a significant step forward in embedding retrieval methods for incomplete multimodal data.
more » « less
Full Text Available
You Only Debias Once: Towards Flexible Accuracy-Fairness Trade-offs at Inference Time

Han, Xiaotian; Chen, Tianlong; Zhou, Kaixiong; Jiang, Zhimeng; Wang, Zhengyang; Hu, Xia (February 2025, The Second Conference on Parsimony and Learning)

Full Text Available
Sparse MoE as a New Retriever: Addressing Missing Modality Problem in Incomplete Multimodal Data

Yun, Sukwon; Xin, Jiayi; Choi, Inyoung; Peng, Jie; Long, Qi; Chen, Tianlong (February 2025, ICLR 2025 https://openreview.net/forum?id=j9DbobO0mY)

In multimodal machine learning, effectively addressing the missing modality scenario is crucial for improving performance in downstream tasks such as in medical contexts where data may be incomplete. Although some attempts have been made to effectively retrieve embeddings for missing modalities, two main bottlenecks remain: the consideration of both intra- and inter-modal context, and the cost of embedding selection, where embeddings often lack modality-specific knowledge. In response, we propose MoE-Retriever, a novel framework inspired by the design principles of Sparse Mixture of Experts (SMoE). First, MoE-Retriever samples the relevant data from modality combinations, using a so-called supporting group to construct intra-modal inputs while incorporating inter-modal inputs. These inputs are then processed by Multi-Head Attention, after which the SMoE Router automatically selects the most relevant expert, i.e., the embedding candidate to be retrieved. Comprehensive experiments on both medical and general multimodal datasets demonstrate the robustness and generalizability of MoE-Retriever, marking a significant step forward in embedding retrieval methods for incomplete multimodal data.
more » « less
Full Text Available

« Prev Next »

Search for: All records